NSF PAR Search | NSF Public Access Repository

Note: When clicking on a Digital Object Identifier (DOI) number, you will be taken to an external site maintained by the publisher. Some full text articles may not yet be available without a charge during the embargo (administrative interval).
What is a DOI Number?

Some links on this page may take you to non-federal websites. Their policies may differ from this site.

Fast and accurate genome-wide predictions and structural modeling of protein–protein interactions using Galaxy

https://doi.org/10.1186/s12859-023-05389-8

Guerler, Aysam; Baker, Dannon; van den Beek, Marius; Gruening, Bjoern; Bouvier, Dave; Coraor, Nate; Shank, Stephen D.; Zehr, Jordan D.; Schatz, Michael C.; Nekrutenko, Anton (December 2023, BMC Bioinformatics)

Abstract BackgroundProtein–protein interactions play a crucial role in almost all cellular processes. Identifying interacting proteins reveals insight into living organisms and yields novel drug targets for disease treatment. Here, we present a publicly available, automated pipeline to predict genome-wide protein–protein interactions and produce high-quality multimeric structural models. ResultsApplication of our method to the Human and Yeast genomes yield protein–protein interaction networks similar in quality to common experimental methods. We identified and modeled Human proteins likely to interact with the papain-like protease of SARS-CoV2’s non-structural protein 3. We also produced models of SARS-CoV2’s spike protein (S) interacting with myelin-oligodendrocyte glycoprotein receptor and dipeptidyl peptidase-4. ConclusionsThe presented method is capable of confidently identifying interactions while providing high-quality multimeric structural models for experimental validation. The interactome modeling pipeline is available at usegalaxy.org and usegalaxy.eu.
more » « less
Full Text Available
Reproducible and accessible analysis of transposon insertion sequencing in Galaxy for qualitative essentiality analyses

https://doi.org/10.1186/s12866-021-02184-4

Larivière, Delphine; Wickham, Laura; Keiler, Kenneth; Nekrutenko, Anton (December 2021, BMC Microbiology)
null (Ed.)
Abstract Background Significant progress has been made in advancing and standardizing tools for human genomic and biomedical research. Yet, the field of next-generation sequencing (NGS) analysis for microorganisms (including multiple pathogens) remains fragmented, lacks accessible and reusable tools, is hindered by local computational resource limitations, and does not offer widely accepted standards. One such “problem areas” is the analysis of Transposon Insertion Sequencing (TIS) data. TIS allows probing of almost the entire genome of a microorganism by introducing random insertions of transposon-derived constructs. The impact of the insertions on the survival and growth under specific conditions provides precise information about genes affecting specific phenotypic characteristics. A wide array of tools has been developed to analyze TIS data. Among the variety of options available, it is often difficult to identify which one can provide a reliable and reproducible analysis. Results Here we sought to understand the challenges and propose reliable practices for the analysis of TIS experiments. Using data from two recent TIS studies, we have developed a series of workflows that include multiple tools for data de-multiplexing, promoter sequence identification, transposon flank alignment, and read count repartition across the genome. Particular attention was paid to quality control procedures, such as determining the optimal tool parameters for the analysis and removal of contamination. Conclusions Our work provides an assessment of the currently available tools for TIS data analysis. It offers ready to use workflows that can be invoked by anyone in the world using our public Galaxy platform ( https://usegalaxy.org ). To lower the entry barriers, we have also developed interactive tutorials explaining details of TIS data analysis procedures at https://bit.ly/gxy-tis .
more » « less
Full Text Available
Correction to ‘The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2022 update’

https://doi.org/10.1093/nar/gkac610

Afgan, Enis; Nekrutenko, Anton; Grüning, Bjórn A.; Blankenberg, Daniel; Goecks, Jeremy; Schatz, Michael C.; Ostrovsky, Alexander E.; et al. (July 2022, Nucleic Acids Research)

Full Text Available
Scalable, accessible and reproducible reference genome assembly and evaluation in Galaxy

https://doi.org/10.1038/s41587-023-02100-3

Lariviere, Delphine; Abueg, Linelle; Brajuka, Nadolina; Gallardo-Alba, Cristóbal; Grüning, Bjorn; Ko, Byung_June; Ostrovsky, Alex; Palmada-Flores, Marc; Pickett, Brandon D.; Rabbani, Keon; et al (March 2024, Nature Biotechnology)

Full Text Available
GYAN: Accelerating Bioinformatics Tools in Galaxy with GPU-Aware Computation Mapping

https://doi.org/10.1109/IPDPSW52791.2021.00037

Gudukbay, Gulsum; Gunasekaran, Jashwant Raj; Feng, Yilin; Kandemir, Mahmut T.; Nekrutenko, Anton; Das, Chita R.; Medvedev, Paul; Gruning, Bjorn; Coraor, Nate; Roach, Nathan; et al (June 2021, 2021 IEEE International Parallel and Distributed Processing Symposium Workshops (IPDPSW))
null (Ed.)
Full Text Available
Ready-to-use public infrastructure for global SARS-CoV-2 monitoring

https://doi.org/10.1038/s41587-021-01069-1

Maier, Wolfgang; Bray, Simon; van den Beek, Marius; Bouvier, Dave; Coraor, Nathan; Miladi, Milad; Singh, Babita; De Argila, Jordi Rambla; Baker, Dannon; Roach, Nathan; et al (October 2021, Nature Biotechnology)
null (Ed.)
Full Text Available
The Galaxy platform for accessible, reproducible and collaborative biomedical analyses: 2022 update

https://doi.org/10.1093/nar/gkac247

Afgan, Enis; Nekrutenko, Anton; Grüning, Bjórn A; Blankenberg, Daniel; Goecks, Jeremy; Schatz, Michael C; Ostrovsky, Alexander E; Mahmoud, Alexandru; Lonie, Andrew J; Syme, Anna; et al (April 2022, Nucleic Acids Research)

Abstract Galaxy is a mature, browser accessible workbench for scientific computing. It enables scientists to share, analyze and visualize their own data, with minimal technical impediments. A thriving global community continues to use, maintain and contribute to the project, with support from multiple national infrastructure providers that enable freely accessible analysis and training services. The Galaxy Training Network supports free, self-directed, virtual training with >230 integrated tutorials. Project engagement metrics have continued to grow over the last 2 years, including source code contributions, publications, software packages wrapped as tools, registered users and their daily analysis jobs, and new independent specialized servers. Key Galaxy technical developments include an improved user interface for launching large-scale analyses with many files, interactive tools for exploratory data analysis, and a complete suite of machine learning tools. Important scientific developments enabled by Galaxy include Vertebrate Genome Project (VGP) assembly workflows and global SARS-CoV-2 collaborations.
more » « less
Full Text Available

Search for: All records